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The localization of sound sources by the human brain is computa- 
tionally simulated from a neurobiological perspective. The simulation 
includes the neural representation of temporal differences in acoustic 
signals between the ipsilateral and contralateral ears for constant sound 
intensities (angular localization), and of volumetric differences in acous- 
tic signals for constant azimuthal angles (radial localization). The trans- 
mission of the original acoustic signal from the environment, through 
each significant stage of intermediate neurons, to the primary auditory 
cortex, is also simulated. The errors that human brains make in at- 
tempting to localize sounds in evolutionarily uncommon environments 
(such as when one ear is in water and one ear is in air) are then math- 
ematically predicted. A basic overview of the physiology behind sound 
localization in the brain is also provided. 
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1 Introduction 



The capacity of vertebrates to localize sound sources has been preserved 
over numerous generations of evolution due to the significant benefits it pro- 
vides regarding the detection of predators. The brain's ability to detect 
microsecond-long temporal differences in signals, and to use just this minis- 
cule article of information for calculating the locations of a sound source to a 
precision of a few degrees, is undoubtedly one of the most remarkable facilities 
it possesses. 

Learning how this fascinating mechanism operates from a neurobiological 
point of view has a number of promising applications to biomedical engineer- 
ing, such as designing aids for patients with damaged cells in the nuclei of the 
superior olivary complex. 

Additionally, gaining knowledge regarding the brain's mechanism for per- 
forming this task may provide insights into improving current synthetic sound 
localizers, which may someday lead to more practical and clever robots than 
those that we have designed thus far. Likewise, methods in acoustical engi- 
neering may draw upon such knowledge of our brain to allow technologies such 
as surround sound speaker systems to portray more realistic manifestations of 
sounds. 

Numerical experiments on a computer may also provide us with informa- 
tion that may be too expensive or time-consuming to extract out of physical 
experiments with live mammals. For example, Homo sapiens especially, are 
much better at locating their distance away from a sound source (distance lo- 
calization), and their angle with respect to the sound source in a 2-dimensional 
plane at ground level (azimuthal localization) , than at estimating the elevations 
of such sources (vertical localization). By constructing mathematical models of 
the brain's function that are able to reproduce available experimental results 
regarding distance localization and azimuthal localization, we may extrapolate 
to observe how our model performs at the task of vertical localization. We can 
then perform numerical experiments to better understand what may improve 
our vertical localization abilities, with the hope to one day be able to construct 
devices that allow pilots and air traffic controllers to better detect altitudes of 
sounds sources. 
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2 2-Dimensional Representation: Equidistant- 
Isovolumentric Signals 



I start by modeling the brain's mechanism of representing temporal dif- 
ferences between the left ear and the right ear's detection of an acoustic signal. 
I will use the term "detector" to denote the human subject that is detecting 
these acoustic signals. Here, the signals are kept at a fixed amplitude (they 
are isovolumetric), and they remain equidistant from the detector's ears, while 
the sound source's angle with respect to the azimuth^] is varied from to 2ir 
radians. See Fig. 1 below]^] 




Figure 1: A detector placed in the center of a 2-dimensional plane. The angle 
between the speaker displayed and the azimuth (0 radians) measures | radians. 

The term "2-Dimensional Representation" used in the title dictates that 
the representation discussed in this section is of the position of a sound source 
in the 2-D plane intersecting the detector's earsj^] 

This model uses the Neural Engineering Framework (NEF) developed by 
Chris Eliasmith and Charles H. Anderson [I]. It begins with a survey of what 
is known about the system (System Description). 

1 a straight line passing perpendicularly through the detector's coronal plane 
2 A11 angles reported herein will be measured clockwise from the azimuth and will be 
reported in radians. 

3 It should be noted that since there is no radial variance, the representation itself will be 
manifested in the form of a 1-D scalar (the angle between the sound source and the azimuth) , 
but since these angles contain information about both the x~ and y — coordinates of the 
source in a 2-D plane, the representation will be called "2-Dimensional." This distinction 
was made to avoid confusion with the following section. 
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2.1 System Description 



2.1.1 Connectivity and Functional Functional Description 

Sounds that reach the ear are transduced by hair cells, which create ac- 
tion potentials in neurons of the anteroventral cochlear nucleus. These action 
potentials are conveyed, via fibers of cranial nerve VIII, to neurons of each 
medial superior oliv^J which is where processing of temporal differentiations 
in acoustic signals begins [2]. See Fig. 2 below. 
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Figure 2: A display of the transmission of acoustic information from the envi- 
ronment to the location where processing of temporal differentiations occurs. 



The neurons of the medial superior olive do not reach the threshold re- 
quired for excitement unless they are stimulated by both (left and right) of 
the incoming neurons from the respective anteroventral cochlear nuclei simul- 
taneously. But the lengths of the axons of the contralateral cochlear nucleus 
vary in length so that only one particular axon sends the action potential to 
its respective receiver in the medial superior olive in time for its stimulus to 
coincide with that of the ipsilateral axon. Since the contralateral ear receives 
the acoustic signal later than the ipsilateral ear, and since this delay depends 
on the source's angle to the azimuth, different neurons of the medial superior 
olive will be activated for each possible location of the source^] For example, 

4 Thcre are two medial superior olives: one for the left ear and one for the right. 

5 When referring to a specific side of the body (whether left or right), the contralateral 
side is the opposite side (whether left or right), while the ipsilateral side is the original side 
in question. 

6 As far as this discussion is concerned, the phrase each possible location only refers to 
locations in one half of the plane, since for example, the time delay associated with a source 
being 15 degrees from the azimuth, will be the exact same as the time delay associated 
with the source being 165 degrees from the azimuth. The brain actually has other ways of 
detecting whether a signal is coming from the anterior or posterior, but this discussion is 
omitted for now. 
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in Fig. 3, if there was a big time delay between the left and right ears receiv- 
ing the signal (i.e. the source's angle to the azimuth was close to 90 degrees), 
neuron 8 or 9 would probably be activated, since the long time delay due to 
the temporal differentiation would have to be negated by a shorter time delay 
from the axon (i.e. a shorter axon length). 
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Figure 3: A display of the transmission of acoustic information from the envi- 
ronment to the location where processing of temporal differentiations occurs. 
Although only one box for the medial superior olive is shown, signals are actu- 
ally sent to both the left and right medial superior olives from each ear. The 
chronological direction of flow in this chart is denoted by the neurons between 
each box (from dendrites to axons in all cases). 
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In Fig. 3, neuron 5 is denoted to be activated, due to the fact that the 
stimulus probably came from either degrees or 180 degrees from the azimuth. 
Of course, in the actual brain there are many more neurons than depicted in 
the diagram (each representing a particular possible location of the sound 
source) . 



2.1.2 Tuning Curves 

In accordance with the above description, each neuron of the medial su- 
perior olive is most sensitive to a particular range of possible time differences. 
Consequently, each neuron will have two specific "preferred locations" for the 
sound source (remembering that there are two possible locations for the sound 
source in Fig. 1 that will result in each particular length of time delay - 
one in the half-plane containing the direction of the azimuth, and one in the 
half-plane containing the negative of that direction). 

The firing rate of a particular neuron will be highest when the sound source 
is positioned in one of these two "preferred locations" and would ideally be zero 
when the sound source is in any other location. However, it is indeed possible 
that the firing rate will be non-zero for certain sound locations near enough 
to the preferred location. Referring to the example in Fig. 3 for instance, 
the signal from the contralateral side can reach the medial superior olive soon 
enough for neuron 6 to fire, or late enough for neuron 4 to fire. 

Consequently, the tuning curves have been modeled as narrow Lorentzian 
curves with peak firing rates chosen between 114/sec and 268/secQ and re- 
flected to be symmetric about | radians. Fig. 4 shows the tuning curve for 
one of these neurons randomly selected from an ensemble of 4000 neurons - 
it's maximal firing rate is approximately 170 spikes/second and it's preferred 
directions are approximately 3.28 radians and 6.14 radians from the azimuth. 
Notice how the preferred directions are symmetric about | radians, and the 
abscissa of the plot is labeled with | radians in the centre. 

Fig. 5 displays the tuning curves of 50 neurons chosen out of the ensemble 
of 4000, and Fig. 6 displays the entire ensemble of neurons. The preferred 
locations are randomly chosen between and 2ir radians, with (HWHM)^] 
randomly chosen between 1 and 2.5 degree^ and with amplitudes randomly 



7 This is because neurons on the medial superior olive have been recorded with firing 
rates in this range. [3] 

8 Half-widths at half-heights 

9 This is based on the range of precision with which the neurons of the medial superior 
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Figure 4: The tuning curve for one neuron chosen out of an ensemble of 4000, 
showing it's maximal firing rate and it's preferred directions. 

chosen between 114 and 268. 

In Fig. 6 we can see that with 4000 neurons, almost every possible sound 
source location has a corresponding neuron with maximal firing rate at that 
location. Of course, it would be impossible to have a separate neuron for each 
of the infinite possible sound source locations between and 2tt radians - the 
small separation between the peaks of the curves in Fig. 6 leads to the fact 
that the human brain can only locate sound sources to within a few degrees 
of precision [2]. 

2.1.3 Variable Specifications and Mathematical Description of the 
System Funciton 

For this simulation, the variable x will denote the angle that the sound 
source makes with the azimuth. The encoding of this variable by each individ- 
ual neuron (i) will be expressed by the Lorentzian neural response functions 
(as depicted in Figs. 4-6): 

olive tend to estimate sound locations based on coincidence detection. See [2] for a reference. 
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Tuning Curves for 50 Nuerons 
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Angle with respect to the azimuth in radians 

gure 5: The tuning curves for 50 neurons chosen out of an ensemble of 4000. 

Tuning Curves for 4000 Nuerons 
300 I 1 1 1 -i 1 1 1 




Angle with respect to the azimuth in radians 

Figure 6: The tuning curves for all 4000 neurons in the ensemble. 
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7r [x - x ) + (frj 2 2 2 



1 -T 3n Ti 
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7T (x - xq) z + {^ry 2 2 



(2) 



Here, -4? represents the peaks of the Lorentzians and xo represents the 
preferred direction for a particular neuron. The neurons whose preferred sound 
source locations are to the right of the human detector when he or she is facing 
forward (i.e. between and ^ radians) are labeled herein as 'on' neurons 
whereas those most sensitive to sound source locations on the left are labeled 
as 'off' neurons. 



2.2 Design Specification 

2.2.1 Range 

The range for the variable x described above is to 2tt radians. These 
values are not normalized to range from to 1. Rather, they are kept as angles 
measured clockwise from the azimuth in radians. The purpose of choosing this 
convention will become clear in subsequent sections of this paper. 

2.2.2 Signal-to-Noise Ratios (SNR) 

Since neurons carry about 3 bits of information per spike pQ. As mentioned 
in Ref. [I], this means that the coding allows approximately 10% error, and 
thus, we can assume that the neurons encode information wtih a signal-to-noise 
ratio (SNR) of approximately 10:1. A 10% error corresponds to a variance of 
ex 2 = 0.1 if the distribution of signals is even. Accordingly, I will assume that 
the noise is independent and Gaussian distributed with a mean of zero and a 
variance of 0.1 for each neuron in the model, 
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10 More information regarding the transmission of signals by individual neurons is pre- 
sented in Ref. jl]. 
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2.2.3 Precision 



For the precision of the neural representations, it is difficult to predict 
how precise this model will be before it is implemented, and therefore, I will 
consider determining the precision of the representation as one of the purposes 
of this model. In the implementation section, the relationship between the 
number of neurons and the precision of the representation will be explored. 

2.3 Implementation 

2.3.1 Decoding Rules 

The decoders used in this section are "optimal linear decoders." The 
decoding can be summarized by the following formula: 

JV 

x = y^(aj(a;) + rdfa (3) 
i=i 

Here, x is the estimation of the signal x, N is the number of neurons, % is 
an index indicating the description of a particular neuron, ai is the firing rate 
of neuron i, rji is the noise associated with neuron i, and fa are the optimal 
linear decoders given below. 

= r _1 T (4) 

where, each element of T is given by: 

Tij — / ai(x)aj(x)dx + a 2 5ij (5) 
Jo 

and, each element of T is given by: 

/■27T 

Tj = J a,j(x)xdx. (6) 
Jo 

A detailed derivation of the above expressions is demonstrated in Ref. [I]. 

2.3.2 Results 

The first implementation of this model was in decoding the signal x using 
4000 neurons. Fig. 7 shows the decoded estimate x and the original signal x 
over the range of x. The root mean square deviation was 0.011. The difference 
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between the decoded estimate and the original signal is plotted over the range 
of x in Fig. 8 (notice the change in the scale of the ordinate axis). 



The decoded signal and the original signal (Xhat and X) 



3tJ2 




Figure 7: Decoded estimate x and original signal x over the range of x. 



2.3.3 Numerical Experiments 

The next item for which this model was implemented was numerical ex- 
perimentation. The first of these numerical tests was implemented with the 
objective to determine the relationship between the number of neurons and 
the error due to noise and static distortion. The relationship for the error due 
to noise should decrease as 4 (where N is the number of neurons) according 
to previous results [5][6][T]. Fig. 9 verifies this expected result, and Fig. 10 
displays the error due to static distortion against the number of neurons. 

The next numerical experiment performed with this model compares the 
above results (using Lorentzian tuning curves) to results using the Leaky 
Integrate-and-Fire (LIF) model and Rectified Linear Models. 

The plots of x and x for these two models are omitted due to their close 
resemblance to the plots of Fig. 7 and Fig. 8 above. The RMS deviation for 
the LIF model however, was only 0.0053; and for the Rectified Linear model, it 
was only 0.0027. This suggests that the severe non-linearity in the Lorentzian 
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Difference between the decoded signal and the original signal (Xhat - X) 




Figure 8: Differences between the decoded estimate and the original signal 
(x — x) over the range of x. 
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Log of Square Error (due to noise) vs Log of Number of Neurons 
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Figure 9: Error due to noise with increasing number of neurons. 
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Log of Square Error (due to distortion) vs Log of Number of Neurons 
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Figure 10: Error due to static distortion with increasing number of neurons. 

tuning curves decreases the precision of their decoding of signals using optimal 
linear decoders. Nevertheless, the Lorentzian tuning curves were chosen for 
the biological plausibility argument presented above, and since they do not 
perform extremely worse than LIF neurons or Rectified Linear neurons in 
decoding the above signal, I have chosen to continue with using tuning curves 
of this form for the remainder of this paper. 

The final numerical experiment performed with this model compares the 
above results (using Lorentzian tuning curves) where the decoders are optimal 
linear decoders, to results in which the decoders used were the preferred direc- 
tions. A similar approach for defining the decoders was used in Georgopoulos' 
model of arm movements in monkeys [I]. The result of this numerical exper- 
iment is displayed in Fig. 11, and it demonstrates that these decoders to 
not work very well. The RMS deviation in this case was approximately 0.73, 
which is higher than that of the optimal linear decoders case by a factor of 
approximately 66.36. 

In accordance with the above results, the remainder of this paper will use 
optimal linear decoders and Lorentzian tuning curves. 
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Decoded signal (Xhat), original signal (X), and their difference fXhat-X) using prefered directions as decoders 
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Figure 11: Decoding using preferred directions rather than optimal linear de- 
coders. 

3 1-Dimensional Representation: Equiangu- 
lar Signals Varying in the Radial Dimension 

3.1 System Description 

3.1.1 Connectivity and Functional Functional Description 

In this section, the sound source will no longer vary in angle, but will 
rather vary in radial distance away from the human that is decoding it. The 
sound source considered is one that emits sounds at a constant frequency 
and intensity (therefore, the human decoder should perceive that sound being 
louder when the radial distance is small, and softer when the radial distance 
is large). 

The brain's mechanism for detecting differences in volume (sound intensity) 
relies on the fact that the sound intensity entering the ipsilateral ear is greater 
than the sound intensity entering the contralateral ear (assuming that the 
head is spherical). This is because sound intensity in the radial direction for 
a spherical sound source is given by[H]: 
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J(r) 



P 



(7) 



47rr 



where I(r) is the intensity, P is the power, and r is the radial distance 
away from the centre of the source. This inverse-square relationship, although 
defined for spherical sound sources, holds approximately true for non-spherical 
source's as well. As depicted in Fig. 12, for a given angle, as the sound source 
moves farther away from the head, the amount of space through which the 
sound waves travel on their way to the contralateral ear consistently remains 
larger than that of the ipsilateral ear, but the difference in path lengths de- 
creases. This means that the intensity difference should get smaller as the 
sound source moves farther away from the detector's head. 



Figure 12: Sound intensity difference decreasing as the sound source is taken 
radially farther from the detector's head. 



On the ipsilateral side, the neurons of the posteroventral cochlear nucleus^ 
transmit signals to the nucleus of the trapezoidal body, and neurons of the 
trapezoidal body send signals to the lateral superior olive. On the contralateral 
side, the neurons of the cochlear nucleus transmit signals directly to the nucleus 

11 As opposed to the angular sound localization discussed in the previous section, where 
the signals are instead sent from the anteroventral cochlear nucleus. 
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of the lateral superior olive [2]. Notice that if the sound source was colinear 
with the azimuth, there would be no difference in intensity between the left 
and right ears (again, assuming the head is a perfect sphere). In this case 
the distinction between contralateral and ipsilateral ears cannot be made, and 
that is why the brain first locates the sound source's angle with respect to the 
azimuth (as described in the above section). Fig. 13 organizes and summarizes 
this information in a flow chart. 
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Figure 13: A flow chart depicting how the acoustic signals from the environ- 
ment reach the lateral superior olive, where radial location of the sound source 
occurs. 

The neurons of the lateral superior olive now have signals entering from 
both the ipsilateral and contralateral ears, and each neuron requires a certain 
amount of input current in order to be excited. The ones that require the 
most input current in order to be excited are the ones that are sensitive to 
signals that are radially close from the detector (where the sound intensity 
is greatest, and therefore the neurons have evolved to "expect" more input 
current). These neurons will be considered 'off' LIF neurons, since their firing 
rate is expected to increase as their radial distance from the signal decreases 
(just like the LIF tuning curve shapes). The ones that require the most input 
current will be the ones with low-valued x-intercepts, and the ones that don't 
require as much will be the ones with larger x-intercepts (their firing rate will 
still be non-zero even for low intensities and far radial distances). 

3.1.2 Tuning Curves 

3.1.3 Variable Specifications and Mathematical Description of the 
System Function 

The variable specifications are the same as before except instead of using 
Lorentzian tuning curves we use 'off' LIF neural response functions. The 
decoding equation is the same as above. 
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Off LIF neurons representing the radial distance of the signal from the centre of the head 
200 I 1 1 1 1 1 1 1 1 1 
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Figure 14: LIF off neurons representing radial distance from the centre of the 
head. 

3.2 Design Specification 

3.2.1 Range 

The range for the radial distances will be 15cm to 50cm. 

3.2.2 Signal to Noise Ratio and Precision 

As before, we assume the signal to noise ratio is 10:1, so the noise will be 
modeled as independent, Gaussian distributed noise with a mean of zero and 
a variance of 0.1. The precision, as in the last section, will be considered the 
purpose of the simulation. 
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4 Feed-Forward Representation 



The first model discussed above dealt with signals about the sound source's 
angle with respect to the azimuth being sent through the anteroventral cochlear 
nuclei to the nucleus of the medial superior olive. The second model discussed 
above dealt with signals about the sound source's radial distance away from the 
detector being sent through the posteroventral cochlear nuclei to the nucleus 
of the lateral superior olive. Of course, to be able to pinpoint the location of 
an object, one needs information about both of these positions (both angular 
and radial). This section describes the amalgamation of these two items of 
information. 

4.1 System Description 

4.1.1 Connectivity and Functional Functional Description 

The final representation of the location of an object can be thought of as a 
2-component vector where one component is the representation of the angular 
position, while the other component is the representation of the radial position. 
The most important location where the representation of this information is 
stored is in the neurons of the primary auditory cortex [2J. The pathway is 
outlined in Fig. 14. The nuclei of the medial and lateral and superior olive 
send signals to the neurons of the inferior colliculus, which send signals to the 
medial geniculate nucleus, which sends signals to the primary auditory cortex. 

In the network displayed above, the representation of a signal by one pop- 
ulation of neurons, serves as the signal to the next population. Then the 
representation of that representation, serves as the signal for the next popu- 
lation, and so on. 

4.1.2 Tuning Surfaces 

As described in [2J, specific neurons represent specific spatial locations 
of sound sources (both angular and radial), so certain neurons will have high 
firing rates when the sound source is a certain angle and distance away, and 
should have low firing rates otherwise. For this reason, the tuning surfaces 
were taken to be 2-dimensional Gaussian functions, with peak firing rates 
chosen to be between 114 and 268 spikes/second (as described in section 1). 
An example of one of these tuning surfaces is shown in Fig. 16. It's peak is 
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Figure 15: A chart displaying the flow of acoustic information through various 
locations in the brain on its way to the primary auditory cortex. The lower 
figure is a screenshot from the neural engineering software NENGO that was 
used for this element of the simulation (NENGO stands for Neural Engineering 
Objects: www.nengo.ca) 



found at approximately (x,y) = (0.25,0.56) and is therefore most sensitive to 
sound sources at that location: 

Fig. 17 displays tuning surfaces for 15 different neurons, all with different 
preferred locations. 

4.1.3 Variable Specifications and Mathematical Description of the 
System Function 

For this model, the variables x and y will represent spatial locations in a 
2-Dimensional flat plane (parallel to the detector's transverse plane). 

The encoding of these variables by each neuron will be expressed as 2- 
dimensional Gaussian tuning surfaces given by the neural response functions 
(as depicted in Figs. 16 and 17): 
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Tuning surface for one neuron as a function of x- and y-coordinates (Normalized between -2 and 2) 



120 --, 




Figure 16: The tuning surface for one neuron whose preferred location is 
approximately (x,y) = (0.25,0.56) . 

Oi(x,y) = FRe- {x - x > )2 - (y - y > )2 (8) 

where FR is the peak firing rate (picked between 114 and 268 spikes/second) 
and xq and yo are the coordinates of the preferred locations of the ith neuron 
(each chosen between -2 and 2). 
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Tuning surfaces for 15 neurons functions of x- and y-coordinates (Normalized between -2 and 2) 




Figure 17: The tuning surface for 15 neurons whose preferred locations are 
randomly distributed between over the plane: —2 < x < 2, —2 < y < 2. 



4.2 Design Specification 
4.2.1 Range 

The range of x (the angular component running from to 2n radians) 
and y (the radial component running from 15cm-50cm) are both normalized 
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to be between -2 and 2. 

4.2.2 Signal-to-Noise Ratios (SNR) and Precision 

As before, we assume that the signal-to-noise ratio is approximately 10:1, 
and therefore the noise will be independent, Gaussian distributed, and with 
a mean of and variance of 0.1 for each neuron in the model. Likewise, the 
precision of the decoding will be one of the purposes of the simulations. 

5 Extrapolation: Spatially- Varying Sound Ve- 
locities 

The neurons of our primary auditory cortex have evolved to represent 
specific locations for sound sources. So if the sound source is at one particular 
location, there are particular neurons that are most sensitive to the stimulus 
that will fire, whereas if the sound source was moved, different neurons would 
be more sensitive. These neurons are getting their signals from the neurons of 
the superior olivary complex, who (in the case of detecting angular positions 
of sound sources) fire based on certain time differences between the signal 
reaching different ears. A hypothetical example would be, if the right ear 
attained the signal 500 fis before the left ear, it might mean that the angle the 
sound source made with the azimuth was | radians, and the neurons of the 
superior olivary complex would send signals to the primary auditory cortex 
informing them that the signal was coming from that direction. 

This will cause problems if the left ear is in water and the right ear is in 
air, as in the case of someone floating sideways in a swimming pool. This is 
because the speed of sound in water is slower than the speed of sound in air, 
so the time difference might be delayed even more than usual. However, the 
neurons of the primary auditory cortex (as far as we understand them) do not 
"know" that one ear is in water and one is in air, so they will take this extended 
time difference to dictate that the sound source was at a greater angle away 
from the detector, while in reality the angle could have been smaller. 

Now that I have constructed a model that is able to locate the position 
of a sound source to a reasonable degree of accuracy, where the sound source 
is assumed to be of constant frequency and intensity, and is assumed to emit 
sound waves that are transmitted through a uniform medium (such as air), this 
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model is able to predict the errors that the brain will make when attempting 
to locate signals when one ear is in water and one ear is in air. 
The speed of sound in a fluid is given by: 



where p is the density of the fluid and K is the bulk modulus of the fluid [8]. 

Here we assume that the speed of flow of the water is much less than the 
speed of sound in the water (so that the additive resultant velocity of sound is 
not much different from the speed of sound itself). Suppose that the density 
of the water is 1-^=? and that the Bulk Modulus is 1.056xl0 9 ^- so that the 
speed of sound in the fluid is approximately 325™ as opposed to 343 — in 
air. This means, assuming a spherical head of diameter 10cm, and a sound 
source making a ^ radian angle with the azimuth and 15cm away from the 
centre of the head, the length of the path of the sound to the left ear will 
be approximately 20.98cm and to the right ear would be 11.05cm (by the 
Law of Cosines). We can calculate using similar triangles that on its way to 
the left ear, the sound travels approximately 11.85cm of the 20.98cm path in 
water, and the remaining 9.13cm in air. Using 32500cm/second for water and 
34300cm/second for air, we get a total time of 0.63/is, whereas for the right ear 
the path is completely in water, so the time it takes is 0.34/xs. This temporal 
difference of 0.29/is is in normal circumstances caused by the source being at 
an angle of |j to the azimuth (assuming its radial distance is still 15cm). This 
is depicted in Fig. 18 below. The blue dot at depicts the actual sound 
source and the red dot is the human's approximation of it. 

Similarly, several other errors are calculated and included in the same fig- 
ure. 




(9) 
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Figure 18: Errors made by the brain in detecting locations of sounds. The 
sound sources are denoted in blue and are 15cm radially outward from the 
centre of the spherically shaped head. The head's diameter is 10cm. The red 
dots are the brain's (incorrect) estimate of the location from which the sound 
is arriving. 
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